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ABSTRACT 

We present a new accurate measurement of the H I mass function of galaxies from the HIPASS Bright Galaxy 
Catalog, a sample of 1000 galaxies with the highest Hi peak flux densities in the southern {5 <Q°) hemi- 
sphere (Koribalski et al. 2003). This sample spans nearly four orders of magnitude in Hi mass (from 
log(MH/M0) + 21og/i7g = 6.8 to 10.6) and is the largest sample of Hi selected galaxies to date. We develop 
a bivariate maximum likelihood technique to measure the space density of galaxies, and show that this is a 
robust method, insensitive to the effects of large scale structure. The resulting H I mass function can be fitted 
satisfactorily with a Schechter function with faint-end slope a = -1.30. This slope is found to be dependent 
on morphological type, with later type galaxies giving steeper slopes. We extensively test various effects that 
potentially bias the determination of the H I mass function, including peculiar motions of galaxies, large scale 
structure, selection bias, and inclination effects, and quantify these biases. The large sample of galaxies enables 
an accurate measurement of the cosmological mass density of neutral gas: 57hi = (3.8 ±0.6) x lQ~'^hj\. Low 
surface brightness galaxies contribute only ~ 15% to this value, consistent with previous findings. 

Subject headings: Galaxies: ISM, Galaxies: Luminosity Function, Mass Function, Radio Lines: Galaxies, 
Surveys, ISM: General 



1. introduction 

Estimates of the Hi mass function (HiMF), the distribu- 
tion function of galaxies as a function of H I mass, have been 
based on very small samples of galaxies. The most exten- 
sive blind H I surveys to date, the Arecibo H I Strip Survey 
(AHISS, Zwaan et al. 1997) and the Arecibo Dual-Beam Sur- 
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vey (ADBS, Rosenberg & Schneider 2002) resulted in 66 and 
265 galaxy detections, respectively. On the contrary, state of 
the art measurements of the optical equivalent of the HiMF, 
the galaxy luminosity function (LF), are based on samples of 
typically a few x 10 galaxies (FoUces et al. 1999, NorlDerg et 
al. 2001, 2dF; Blanton et al. 2001, SDSS). These large area, 
low redshift surveys have resulted in a very accurate census 
of the local galaxy population and have shown that the power 
law faint-end slope of the LF is 'flat' (a = -1.2) down to the 
lowest luminosities {MB-5\ogh-is < -15). The shape of the 
LF and the derived integral cosmological luminosity density 
pi provide important constraints to galaxy evolution models. 
However, for an accurate assessment of the distribution and 
content of baryons in the universe, it is important to also mea- 
sure the HiMF with high accuracy. 

The H I Parkes All Sky Survey (HIPASS) is a blind survey 
of the whole southern sky south of 5 = 25° in the velocity 
range -1200 <cz< 12700 kms"'. Analysis of the extragalac- 
tic component of the survey is ongoing and the final galaxy 
sample is estimated to have ^ 7000 entries. This deep cata- 
log is expected to make important contributions to the map- 
ping and understanding of large scale structure, the interpre- 
tation of QSO absorption-lines, analysis of galaxy groups and 
clusters, etc. Here we use the first product of the HIPASS 
database, the HIPASS Bright Galaxy Catalog (BGC, Korib- 
alski et al. 2003), which consists of the 1000 galaxies with 
the highest peak flux densities. This sample is four times as 
large as the next largest H I selected galaxy sample (ADBS) 
and therefore enables a more careful analysis of the shape of 
the HiMF as well as its dependence on various galaxy param- 
eters. 

An accurate measurement of the HiMF is highly relevant 
to several modern astrophysical problems. Firstly, the HiMF 



2 



Zwaan et al. 



provides a measurement of the cosmological mass density of 
H I, f^Hi, in the local universe. This is an important bench- 
mark in the mapping out of the evolution of the neutral gas 
mass density from before the epoch of reionization, when 
the vast majority of baryons was in the form of H I, to the 
present epoch where the mass in stars outweighs that in neu- 
tral gas (Lanzetta, Wolfe & Turnshek 1995, Storrie-Lombardi 
& Wolfe 2000). At intermediate redshifts, damped Ly-a sys- 
tems are used to trace the evolution of fini, but because of 
their low number density dN/dz at low z, damped Ly-a can 
not be used to accurately measure flei at z = 0. 

Secondly, the shape of the HiMF faint-end slope provides 
useful input to galaxy formation models. Both the LF and 
the HiMF are required to link the local galaxy population 
to the model prediction, since both functions measure differ- 
ent baryonic components of galaxies. The HiMF measures 
the distribution of OTa.s-.s- of the innate cool gas in galaxies, 
whereas the LF describes the distribution of light emission of 
processed material (i.e. stars), which is non-trivially linked 
to its mass. Many dwarf galaxies have large gas fractions 
(e.g., Roberts & Haynes 1994), which means that the HiMF is 
closely related to the low mass end of the total mass function 
of galaxies. Of course, this relation breaks down in regions of 
high galaxy density, where gas poor dwarf elUptical galaxies 
dominate the number counts of low mass galaxies. Measuring 
the low mass end of the mass function is specifically interest- 
ing with regard to Cold Dark Matter (CDM) theory, which 
predicts an abundance of low mass objects, which might be 
detectable in 21cm. However, the detectability could be de- 
creased if gas is ejected by early supemovae (Dekel & Silk 
1986; Babul & Rees 1992; Babul & Ferguson 1996) or photo- 
evaporized by the cosmic UV background during reionization 
(Barkana & Loeb 1999). Also at lower redshifts can ioniza- 
tion by the UV background of lower column density regions 
of H I disks have an effect on the slope of the HiMF (Corbelli, 
Salpeter, & Bandiera 2001). 

Recently, there has been considerable controversy over the 
faint-end slope of the HiMF. Zwaan et al. (1997) found a 
slope of a = -1.2, consistent with previous estimates based 
on optically selected galaxies (Briggs & Rao 1993). Similar 
values for a were found by Kraan-Korteweg et al. (1999). 
Schneider, Spitzak & Rosenberg (1999) report a flat HiMF 
with an extremely steep low-mass slope below Mm = 10*^. In 
a more recent analysis Rosenberg & Schneider (2002) advo- 
cated a faint-end slope of a = -1 .53. 

The aim of this paper is to calculate the HiMF with higher 
accuracy using the H1R\SS BGC, the largest sample of H I 
selected galaxies available to date. Particular emphasis is 
directed toward understanding possible biases in the calcu- 
lation of the HiMF. In Section 2 we briefly summarize the 
HIR\SS specifics and describe the BGC. In Section 3 vari- 
ous estimators of the HiMF are discussed and a new method, 
the 2-dimensional stepwise maximum likelihood method is 
described. The H I mass function is presented in Section 4, 
which includes a detailed discussion on possible biases that 
may influence the HiMF calculation. In Section 5 the results 
are compared to previous measurements of the HiMF. The 
contribution of different galaxy types is discussed in Section 
6. Section 7 presents a discussion on the H I mass density pni, 
and in Section 8 the selection function of the survey is dis- 
cussed. Finally, in Section 9 the conclusions are presented. 
We use Hq = 75kms~'Mpc~^ throughout this paper. 



2. THE HIPASS BRIGHT GALAXY CATALOG 

The BGC (Koribalski et al. 2003) is a catalog of the 1000 
galaxies with the largest Hi peak flux densities''' in the 
southern sky {5 < 0°). The sample is based on HIPASS, for 
which the observing strategy and reduction details are de- 
scribed in Barnes et al. (2001). Here we briefly summarize 
the HIPASS survey strategy. 

The observations were conducted in the period from 1997 
to 2000 with the Parkes^'' 64-m radio telescope, using the 21- 
cm multibeam receiver (Staveley-Smith et al. 1996). The tele- 
scope scanned along strips of 8° in dechnation and data were 
recorded for thirteen independent beams, each with two po- 
larizations. A total of 1024 channels over a total bandwidth 
of 64 MHz were recorded, resulting in a channel separation of 
Av =13.2 kms"' and a velocity resolution of Si'= 18 kms"' 
after Tukey smoothing. The total velocity coverage is -1200 
to 12700 kms"'. After bandpass calibration, continuum sub- 
traction and gridding into 8° x 8° cubes, the typical rms noise 
is 13 mJy/beam. This leads to a 3a column density limit of 
« 6 X 10' **cm"^ for gas filling the beam. The spatial resolution 
of the gridded data is 15 '.5. 

An automatic galaxy finding algorithm (see Kilborn et al. 
2002) was applied to the HIPASS data set to identify all 
sources with Sp > 60 mJy. The list of potential detections was 
inspected by eye to separate radio frequency interference and 
bandpass ripples from real H I sources. Since the noise in the 
HIPASS data is considerably higher at low Galactic latitude, 
the fist of detections was complemented with \h\ < 3° detec- 
tions from the H i Zone of Avoidance survey (Kenning et al. 
2000). Furthermore, to avoid confusion with the Milky Way 
Galaxy and high velocity clouds, the range |v| < 350 kms"' 
was excluded from the list and substituted with known nearby 
galaxies. From the resulting Ust the 1000 galaxies with the 
highest H I peak fluxes were selected, resulting in a selection 
limit of S'p > 1 16 mJy. Further details of the selection, as well 
as properties of the BGC galaxies, are given in Koribalski et 
al. (2003). An important point to note here is that only four 
BGC sources outside the Zone of Avoidance (b > 5°) have 
no optical counterpart on the Digitized Sky Survey. One is 
obscured by the LMC, one is a tidal cloud of NGC 2442 (Ry- 
der et al. 2001) and the other two have heliocentric velocities 
< 500 kms"' and are probably Magellanic debris (Koribalski 
et al. 2003). 

HIPASS achieves 100% coverage over the southern sky, and 
has a very uniform noise level. However, there are regions of 
the sky near strong sources of radio continuum, where the 
noise level is elevated. Near such sources, it is possible that 
we are missing bright galaxies. As above, the typical rms 
noise level is 13 mJy/beam. However, above an rms noise 
level of 23 mJy/beam, the faintest BGC objects are detected 
with a signal-to-noise ratio of no more than 5 in each velocity 
channel. Within the BGC velocity range, 2.5% of all southern 
HIPASS spectra have an rms noise greater than 23 mJy/beam. 
This gives a useful upper limit to the incompleteness of the 
BGC. 



" In the remainder of this paper we refer to 'peak flux density' as 'peak 
flux'. 

The Parkas telescope is part of the Australia Telescope, which is funded 
by the Commonwealth of Australia for operation as a National Facflity man- 
aged by CSIRO. 
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3. CALCULATING THE HI MASS FUNCTION 

3.1. Methods 

We define the H l mass function 0(Mhi) as the space den- 
sity of objects in units of /i^gMpc"^. The HiMF is normally 
calculated per decade of H I mass, and plotted on a logarith- 
mic scale. For fitting purposes, we use the Schechter function 
(Schechter, 1976) defined by 



9iMii,)dMni = 0*{^r expi-^)dMHu 



Mi, 



(1) 



characterized by the three parameters a, M^j, and 9*, which 
define the slope of the power-law, the H I mass corresponding 
to the "knee", and the normalization, respectively. 

The lA'niax method originally developed by Schmidt (1968) 
to study the evolution of quasars, simply consists of summing 
in H I mass bins the reciprocals of Vmax, the volume corre- 
sponding to the maximum distance Da^j at which an object 
can be detected. This is a popular method for determining 
H I mass functions because the values of Vmax can be readily 
evaluated for every survey if the survey sensitivity is well- 
understood (see Zwaan 1997, Rosenberg & Schneider 2002, 
Kilborn 2000). The method also works well for surveys that 
are not purely flux-limited, but where the survey selection is 
a complicated function of various galaxy parameters and tele- 
scope properties. For the BGC, Z)iim,; can simply be found 
by multiplying the distance D, at which the object is detected 
with {S/S\isa)^^, where Si and S\im are the peak flux of object 
/ and the limiting peak flux of the sample, which is 1 16 mJy. 

The main concern about the Vmax method is that it is po- 
tentially sensitive to the influence of large scale structure. 
The Vmax method intrinsically assumes that the galaxy pop- 
ulation used to evaluate luminosity functions or mass func- 
tions is homogeneously distributed in space. In sensitivity- 
limited samples (as opposed to volume-limited samples), a 
correlation exists between the mass or luminosity of objects 
and the distance at which they are preferentially found. Un- 
derdense or overdense regions may therefore yield under or 
over-representation of objects of corresponding mass or lumi- 
nosity. Maximum likelihood techniques (Sandage, Tamman 
& Yahil 1979 [STY] and the stepwise maximum likelihood 
method [SWML] Efstathiou, Ellis & Peterson 1988) ai-e de- 
signed to be insensitive to density fluctuations and it is there- 
fore important to test whether these methods can be employed 
for our H I selected galaxy sample. 

The key of the maximum likelihood techniques is to find the 
parent distribution 9 which yields the maximum joint proba- 
bility of detecting all objects in the sample. The probability 
that a galaxy with H l mass Mhi./ be detected can be expressed 
as 

p(Mmj \Di) = — , , (2) 



MHI.Iim(0/) 



9(Mni)dMm 



where MHi.iim(A) the minimal detectable H I mass at dis- 
tance Di in Mpc. Put differently, p(Mui,i\Di) is the fraction of 
galaxies in the survey volume with H I mass Mhi./ that are near 
enough to be brighter than the survey detection limit. Finding 
for which 6'(Mhi) the product of the probabihties. 



(3) 



is maximal, then gives the maximum likelihood solution for 
the mass function. The disadvantage of this procedure is that 



it is parametric, that is, it requires an analytical expression for 
9, which is usually taken to be the Schechter function, as in 
EqH The stepwise maximum likelihood method (SWML) is 
a modification of this procedure, and measures 9 at fixed inter- 
vals of log Mhi by iteration. This procedure does not depend 
on a functional form for 9. 

As described above the maximum likelihood estimators re- 
quire the calculation of MHi,iiin(£'!), the minimal detectable 
Hi mass at distance D,. For a hypothetical sample limited 
by integrated H I flux, this parameter could be simply calcu- 
lated by MHi,iiin(£>) = 2.36 X lO^D^ / SdVy,^, where / SdVu,^ is 
the integrated flux limit in Jy kms"' and D is the distance in 
Mpc. For optical redshift surveys the analogous value of min- 
imal detectable luminosity Liin,(Z)) can be determined since 
optical redshift surveys are generally flux-limited, and maxi- 
mum likelihood methods can be readily applied. H I selected 
samples are seldom integrated flux-limited, but instead the de- 
tectability of signals depends on both the velocity width AV 
and peak flux Sp (Zwaan et al. 1997, Rosenberg & Schneider 
2000), or in the case of the BGC, solely on Sp. Therefore, a 
unique relation between distance and the minimal detectable 
H I mass does not exist, and the standard maximum likelihood 
techniques can not be employed. 

In principle, for every galaxy in the sample a parameter 
MHi,iim(£'!,^'/) can be calculated, which is the minimal de- 
tectable H I mass at distance D,-, with profile shape P,- equal 
to that of galaxy /. Rosenberg & Schneider (2002) apply 
this method to their ADBS sample and find an HiMF with 
faint-end slope a = -1.53. However, this method can lead 
to incorrect measurements of the HiMF. The SWML essen- 
tially consists of determining for each bin k in logMni the 
volume accessible to that H I mass, by summing the recipro- 
cals of space densities of galaxies that could be detected in 
the volume out to the maximum distance at which a galaxy in 
bin k could be detected. A summation over all galaxies with 
Mhi < Muifim(Di,Pi) is not the same. 

In the following we briefly demonstrate that this implemen- 
tation of the maximum likelihood technique for the BGC can 
result in a severe overestimation of the faint-end slope a. We 
fill volumes equal to that of the BGC with synthetic galaxy 
samples using HiMFs with slopes a = -1.0 and a = -1.4. 
The galaxies follow a general trend of AV oc Mhi"^^, simi- 
lar to what is observed in the BGC, and we introduce scatter 
on all galaxy properties equal to that seen in the BGC. Next 
we select from these volumes all galaxies with 5p > 1 16 mJy, 
and calculate the HiMF following the SWML method as de- 
scribed above. From these synthetic peak flux selected sam- 
ples we also select integrated flux-limited subsamples, for 
which Sint > 25 Jykms"', and also apply the SWML method 
to these samples. We test by means of a V/Vmax test that the 
cutoff at Sint > 25 Jykms"' provides a statistically complete 
subset. Finally, the l/Vmax method is applied to both the peak 
flux-limited samples and the integrated flux-limited subsam- 
ples. All simulations are performed 100 times. 

The results are presented in Figure Q The top two panels 
represent the results from the SWML method and the bottom 
two panels those from the lA'max method. In each panel the 
solid lines show the input Schechter function, the circles the 
measured HiMFs for the peak flux limited samples and the 
open squares the measured HiMF for the integrated flux lim- 
ited subsamples. The dashed fine is a Schechter function fit to 
the points. 
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Fig. 1. — Testing the conventional SWML metliod on peak flux limited 
samples. In each panel the sohd line shows the input Schechter function that 
is used to constract synthetic galaxy samples, the filled circles represent the 
output H I mass functions of the total sample and the squares show that of an 
integrated flux limited subsample. In the top two panels the SWML method is 
applied, in the bottom two the I/Vmax method is used. The SWML application 
on a peak flux limited sample clearly gives incorrect results. 

Both methods recover the input HiMF very well if they are 
applied to the integrated flux limited subsamples. However, 
this figure illustrates clearly that the SWML method results in 
a severe overestimation of the faint-end slope if it is applied 
to the peak flux limited samples. 

We note that the examples presented here are specific to the 
BGC. To which degree the output HiMF is in error depends 
very strongly on the selection function of the H I survey and 
there are situations possible in which the SWML coinciden- 
tally gives the correct answer. 

3.2. The 2-dimensional SWML method (2DSWML) 

On galaxy samples that are other than flux-limited, other 
parameters have to be included in the maximum likelihood 
calculation of the space density of objects. For a given H I 
source at distance D, the peak flux is directly proportional to 
its H I mass Mm, and inversely proportional to the velocity 
width over which the flux is distributed. Therefore, the veloc- 
ity width is the second parameter we have to include in the 
maximum likelihood analysis. A 2-dimensional maximum 
likelihood algorithm can then be employed to find the true 
galaxy distribution. In the following we use W20, the profile 
width measured at 20% of peak intensity, as a measurement 
of the velocity width. 

The probability of detecting galaxy / with H I mass MJjj and 
velocity width at distance D, is given by 



/7(M{ji,w^olA) = 



Ju-'20=0 JI\ 



where 



Ml 



9(Mm , W2o)dMmdw20 



i Slim W20 



Hl.lim 



iW20)=M'm— 

Si w\ 



(4) 



(5) 



is the minimal H I mass the galaxy could have to be detectable 
at distance D,- if it had a velocity width W20, and 6'(Mhi,W2o) 
is the 2-dimensional density distribution function. However, 
we have no a priori knowledge of the functional form of 
6'(Mhi, W20), which implies that a parametric maximum likeli- 
hood method (STY) can not be applied. 

Alternatively, we can define the logarithmically binned 2- 
dimensional density distribution function 



where 



ejk = e{M,w), 



j=l,...,NM and fc= l,...,A^w 



(6) 



(7) 



and Nm and are the number of bins in M and W , respec- 
tively, and we define 



M = log(MHi) and W = log(w2o)- 



(8) 



The logarithm of the likelihood of detecting all galaxies in 
the sample can now be expressed as 

f^e Nm A'w 

In £ = ^ ^ ^ y (M,- Mj ,Wi-Wk)\n 0jk 
i=i j=i k=\ 



-E 

1=1 



In 



Nm A'w 
j=\ k=l 



where c is a constant and V is a function defined by 

V(x,y): 



l,\x\< AM/2 and \y\ < AW/2 
0, otherwise 



(9) 



(10) 



which makes the first sum only go over galaxies in bin 
AM Aw. The function Hjjit is defined as the fraction of the 
bin accessible to source /: 



Hijk = 



1 



AMAW 



W=Wt-AW/2 JM=M 



Mi+AM/2 



dMdW, (11) 



where 

M" = maxiMj - AM/2 , min(M; + AM/2 , Mu^.d) , 

Miim.i = Mi + log{Sun/Si) -Wi + Wk, 
W^ = miniWk + AW/2,ma\iWk - AW/2, Wi™,,)), 
= Wi-logiSUSi)-Mi+Mj. 

This implies that 

Hijk = lif Mi-Mj-Wi + Wk + 

log(Sii,^Si) + AW/2 + AM/2 < 

Hijk =0 if Mi-Mj-Wi + Wk + 

log(5iin/5,)- AW/2 -AM/2 > 0. 

Maximum likelihood solutions for 9jk are now found by dif- 
ferentiating C and applying the usual additional constraint to 
fix the normalization (see e.g., Efstathiou et al. 1988). We 
then arrive at 



Ne 



Y^V(Mi-Mj,Wi-Wk) 



"g / Nm A'w \ 

Y^HijkAMAW Y,Y.^,,nHii,„AMAW 



r(l2) 



20 



, 1=1 m=l 
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and most likely values for djit are found by iterating Eq. 
Stable solutions for 9jk are usually found after ~ 25 iterations. 
Finally, the HiMF can be calculated from 



Nw 



eiMj) = Y,mj,Wk). 



(13) 



k=l 



In the remainder of this paper we refer to this method as 
the 2-dimensional stepwise maximum likelihood method or 
2DSWML. A very similar technique was used by Loveday 
(2000) to calculate the A'-band luminosity function from a 
Z?-band selected galaxy sample. The implementation of the 
2DS WML method described in this paper is designed to work 
on the peak flux limited BGC, but the method could be ad- 
justed to work on samples other than peak flux limited. This 
will be of particular interest for the full sensitivity HIPASS 
galaxy catalog (Meyer et al. 2003a) for which the selection 
criteria are a combination of peak flux and velocity width. 

3.3. The selection function 

Whereas for the Vmax method the normalization of the mass 
function is automatically determined, for the maximum like- 
lihood methods the normalization has to be determined after- 
wards. First we have to evaluate the selection function S(D), 
which describes the probability that an object at distance D is 
detected by the survey. For a flux-limited sample the selection 
function would normally be calculated with 



J maxO 



S(D) = 



max(Miim(D),Mi„„) 



e(M)dM 



(14) 



where Mhigh and Mi^w are the highest and lowest values of 
logMei in the sample. For our 2-dimensional distribution 
function this would translate to 



5(D) = 



M, 



high 



max(Miim(D,H'),Mi„w) 



e(M,W)dMdW 



rMhigh 



eiM,W)dMdW 



(15) 



where Whigh and Wiow are the highest and lowest values of 
logW2o in the sample and M\i^{D,W) is the minimal de- 
tectable H I mass of a galaxy with log profile width W and 
at distance D Mpc. The problem is that we can not derive an 
analytical expression for Miim(D, W). It would be possible to 
derive an empirical relation for Miin,(D, W), but it is far better 
to simply make use of the actual data. Therefore, we evaluate 
the selection function in Eq.[21for each galaxy individually 
and then average the values of S{Di) in bins of distance. 

The mean galaxy density n is then determined by correct- 
ing the measured distance distribution of objects by the se- 
lection function. There are various methods described in the 
literature of determining n (Davis & Huchra 1982, Willmer 
1997). Here we choose to use the minimum-variance estima- 
tor (Davis & Huchra 1982), but tests with different estimators 
gave very similar results. In the calculation of ft, the selec- 
tion function is weighted with the inverse of the second mo- 
ment of the two-point correlation function, which we set to 
73 = 8000/!7|Mpc3 (Meyer et al. 2003b). The value of n is 
found to be very insensitive to the exact value of Jj. Since at 
very low and high distances the number of galaxies is small, 
the selection function is not accurately known in those re- 
gions. We therefore choose to limit the calculation of il to 
0.001 < S(D) < 0.1, which roughly corresponds to the dis- 
tance range 10 < D < 50 Mpc. Limiting the calculation to this 
distance range also ensures that the overdense volume within 



two correlation lengths around the Milky Way Galaxy is not 
taken into account in the calculation of n. Finally, we normal- 
ize the H I mass function by setting 



Whigh /-Mhigh 



e(M,W)dMdW = n. 



(16) 



3.4. Testing the 2DSWML method 

Before applying the 2-dimensional SWML method to the 
BGC, we run a number of simulations to test whether the 
method produces reliable results. Similarly to the simulations 
in Section l3 3.1l we fill volumes with synthetic galaxy samples 
and select sources from the volume in the same manner as the 
BGC is selected. The galaxy samples are constructed such 
that the distribution and the correlation statistics of peak flux, 
velocity width and H I mass are similar to those of the BGC 
and the samples typically contain ^ 1000 galaxies. Each test 
is based on 100 simulations. 
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Fig. 2. — Testing the 2DSWML method. The left panels show the input 
HiMFs represented by solid lines and recovered HiMFs by open symbols 
and errorbars. In the light panels the thin-lined histograms show the redshift 
distribution of the simulated data and the solid lines are the reconstructed 
selection functions multiplied by flD'ADn. The thin dashed hnes represent 
1(7 uncertainties on the selection functions. 

Figure 12] shows the results of three different tests. In the 
left panels the input HiMFs are represented by solid lines, 
whereas the measured HiMFs are shown by open symbols 
and errorbars. In the right panels the thin-lined histograms 
show the redshift distribution of the simulated data and the 
solid lines are the reconstructed selection functions multiplied 
by ilD^ADii. The thin dashed lines represent la uncertain- 
ties on the selection functions. From top to bottom we show 
simulated data based on 1) a faint-end slope a = -1.1 and an 
overdensity at ~ 30 Mpc, 2) a = -1.5 and an underdensity at 
~ 20 Mpc, and 3) a = -1 .3 and an underdensity at ~ 30 Mpc 
and an overdensity at '--^ 10 Mpc. 

It is clear that the 2DSWML method is capable of recover- 
ing the input HiMF with high accuracy, independent of strong 
over- and underdensities in the redshift distribution of galax- 
ies. Also our implementation of the calculation of the selec- 
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tion function gives satisfactory results. The selection function 
corresponds well with the overall redshift distribution and is 
insensitive to strong density variations. Motivated by the suc- 
cess of the 2DSWML method on synthetic data, we choose to 
apply it to BGC. The results are described in the next section. 

4. RESULTS 

A greyscale representation of the density distribution func- 
tion 9jk is shown in Figure |3l This function is calculated 
with bin sizes AM = 0.2 and AW = 0.1. This choice of bin 
sizes results in a 24 x 15 grid containing 360 bins dMdW, of 
which 144 contain galaxies. The mean occupancy of the bins 
is therefore 7 galaxies per bin. In Figure |3l we also plot the 
data points on which the analysis is based. It is clear that the 
low mass bins contain very few galaxies, which explains the 
noisy character of the lower left part of the figure. Not sur- 
prisingly, there is a strong correlation between the H I mass of 
a galaxy and its observed velocity width. Note that the plot- 
ted velocity width is the observed width of the H I signals, 
uncorrected for galaxy inclination. 
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Fig. 3. — Bivariate galaxy density distribution in tlie (Mhi,»'2o) plane, 
derived via the 2DSWML method. The greyscales are on a logarithmic scale 
and represent the space density per decade of Mm and decade of W2o- The 
distribution of data points is shown by black dots. 

A comparison between the greyscales and the black points 
illustrates that a peak flux limited sample is, at a particular 
H I mass, slightly biased toward galaxies with low velocity 
widths, or alternatively, at a particular velocity width, slightly 
biased toward galaxies with high H I masses. Although the 
survey contains a large fraction of high H I mass, low velocity 
width galaxies, the true space density of these objects is very 
low. 

The Hi mass function can now be found by applying 
Eq. ^] to the 2-dimensional density distribution Ojf.. The 
result is shown in Figure |3 where the dots show the mea- 
sured space density of objects per decade of Hi mass and 
the solid line is the best fit Schechter function with param- 
eters a = -1 .30 ± 0.08, log(Mi^/M0) = 9.79 ± 0.06, and d* = 
(8.6 ±2.1) X lO'^Mpc""*. The Schechter function provides an 



excellent fit to the data. In fact, thanks to the large sample 
size and hence small Poisson errors, we convincingly show 
here for the first time that the H I mass function of galaxies 
can be satisfactorily described by a Schechter function. Note, 
however, that the shape and the normalization of the HiMF 
are determined without using a Schechter function as an as- 
sumption about the intrinsic shape of the HiMF. 
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Fig. 4. — The BGC Hi mass function. Bottom panel: Distribution of 
H I masses in the BGC per bin of 0.2 dex. Top panel: Points and eiTorbars 
show the space density of objects derived with the 2DSWML method. The 
solid Hne is a best-fit Schechter function which has been determined by 
minimalization. The best fit Schechter parameters are also shown. 

The uncertainties on the Schechter parameters are deter- 
mined with the jackknife error estimator (Lupton 1993). This 
is simply done by dividing the sample into 24 equal regions 
of sky and calculating the H I mass function 24 times, leaving 
out a different region each time. A Schechter function is fit- 
ted to each mass function and the errors on the parameters are 
found by 

al = ^^,{x-x)\ (17) 

where = 24. The quoted uncertainties therefore include sta- 
tistical errors and incorporate the effects of large scale struc- 
ture, but they do not include systematic errors. 

As is normally the case in these fits, the errors in a and 
logMni are correlated in the sense that steeper faint-end 
slopes imply higher values of logMni. The correlation co- 
efficient between a and logMei is r = -0.28, but due to the 
small number of jackknife samples (A^ = 24), the error on r is 
large. 

The 2-dimensional distribution function 6jk can also be in- 
tegrated along lines of constant velocity width to obtain the 
'velocity width function', the space density of objects as a 
function of velocity width, or, after inclination correction, as a 
function of rotational velocity. This analysis will be the topic 
of a forthcoming paper. 

4.1. Biases in the calculation of the HlMF 

In the following we investigate the influence of various se- 
lection effects on the shape of the HiMF and on the measure- 
ment of the integral H I mass density. The effects of the biases 
are summarized in Table [l] 
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4.1.1. Noise on H I detection spectra 

The BGC consist of the brightest galaxies in the total 
HIPASS galaxy sample and therefore the signal to noise of 
the detections is high. The average rms noise in the HIPASS 
data is 13 mJy/beam per 18.0 km/s, which means that with 
the BGC detection limit of 116 mJy the lowest possible S/N 
level would be 9. In reality, because the detections are spread 
out over more than one resolution element, the S/N ratios are 
generally higher. However, since most detections are much 
wider than one resolution element, the measured peak flux is 
often an overestimation of the galaxy's noise-free peak flux. 
This becomes increasingly important for flat-topped profiles 
in which a large fraction of the channels are close to the peak 
flux. 

We test this selection bias by adding Gaussian noise with a 
rms dispersion of 1 3 mJy to the H I profiles of synthetic galaxy 
samples which resemble the BGC, and selecting galaxies by 
their measured peak flux. Next we calculate HiMFs from 
these samples using the 2DSWML method. Two extreme sit- 
uations are tested, one in which all profiles are Gaussian, and 
one in which all profiles are double-horned. Figure |5l shows 
the true HiMF of the simulated galaxies as a solid line, the re- 
covered HiMF for double-horned profiles as open circles and 
the recovered HiMF for Gaussian profiles as crosses. 
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Fig. 5. — The effect of noise on the detection spectra on the HiMF de- 
termination. The solid Hne shows the input HiMF with a low mass slope of 
a = — 1.3 that has been used to create synthetic galaxy samples. Noise with 
an rms fluctuation of 13 mJy has been added to the synthetic galaxy spectra. 
The points show the recovered 2DSWML HiMF assuming Gaussian profiles 
(crosses) and double-horned profiles (circles). 

For both cases the selection bias causes a small global over- 
estimation of the HiMF. For the Gaussian profiles, a is overes- 
timated by Aa « 0.03, whereas for the double-horned profiles 
the change in a is negligible. For the BGC, which is domi- 
nated by galaxies showing double-horned profiles, the integral 
H I mass density pei is probably overestimated by less than 
15%. 

4.1.2. The Eddington effect 

Another possible source of error in the H I mass function is 
the so-called 'Eddington effect'. This effect causes a steepen- 
ing of the low mass slope due to distance uncertainties. For 
the vast majority of the BGC galaxies distances are calculated 
from their recessional velocities and peculiar velocities can 
introduce errors in these distance estimates and hence in the 
H I masses. In a relatively nearby sample such as the BGC 



the Eddington effect is potentially much more important than 
for deeper surveys like the AHISS survey, for which the peak 
of the galaxy distance distribution is much higher and relative 
distance uncertainties are smaller 

To test the severeness of the Eddington effect, we add Gaus- 
sian noise to the recessional velocities in our synthetic galaxy 
samples and select galaxies from the samples similar to the se- 
lection of the BGC. The results are summarized in Figure |6l 
The solid line shows the input HiMF and the symbols show 
the recovered HiMFs for samples for which dispersions of 
50 (squares), 100 (triangles), 200 (crosses) and 400 (circles) 
kms"' are added. Each simulation is performed 100 times. 
The 2DSWML method has been used to calculate the HlMFs. 

Figure |6l shows that the Eddington effect is a potentially 
important bias in our analysis. If a velocity dispersion of 
200 kms"' is added to our synthetic galaxy samples, the mea- 
sured HlMF slope increases from a = -1.30 to a = -1.43. 
For a velocity dispersion of 100 kms"' a rises to -1.36, for 
50 kms"' the change in a is no longer measurable. As ex- 
pected, the Eddington effect has almost no influence on the 
measured space density of sources around Mm = Mj^j, there- 
fore the effect on the integral H I mass density pm is small. 
The inset in Figure |6l shows the recovered selection function 
S(D) for each of the simulations, compared to the selection 
function of a galaxy sample without added noise to the dis- 
tances (dashed line). The selection functions have been mul- 
tiplied with and are normalized to a peak value equal to 
unity. The x-axis is logarithmic to better show the effects on 
the selection function at small distances. This clearly illus- 
trates that the Eddington effect artificially steepens the selec- 
tion function at small distances. 
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Fig. 6. — The Eddington effect. The soHd Hne shows the HiMF of sim- 
ulated data sets, with a low mass slope of a = -1.3. The symbols show the 
recovered 2DSWML HiMFs after Gaussian noise is added to the recessional 
velocities of the galaxies. Squares are for a dispersion of 50 kms"', triangles 
for 100 kms"', crosses for 200 kms"', and circles for 400 kms"'. 

Our model of adding Gaussian noise to the recessional ve- 
locities of galaxies oversimplifies the true peculiar velocity 
distribution. The real dispersion is a function of local density 
(Strauss, Ostriker & Cen 1998) and should be directional to 
some degree, instead of random. However, Sheth & Diafe- 
rio (2001) recently showed that the expected peculiar veloc- 
ity distribution in CDM simulations is nearly Gaussian. The 
1-D pairwise peculiar velocity dispersion an is found to be 
« 300 kms"' on scales of l/iJoo Mpc (e.g. Jing, Borner & Suto 
2002) and lower on smaller scales. This value compares to a 
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velocity noise of cr = 2~^^ai2 = 212 kms~'. Clusters of galax- 
ies contribute significantly to this high value and the velocity 
field outside of clusters, where late-type galaxies dominate 
the statistics, is known to be much colder (e.g. Strauss et al. 
1998). For example, WilHck et al. (1997) find a = 125 kms"' 
for local (cz < 3000 km s~') spiral galaxies. More locally, the 
velocity field is even colder than this: the peculiar velocity 
dispersion of galaxies within 5 Mpc of the Milky Way Galaxy 
is only 60 kms"' (Schlegel et al. 1994). Considering the latter 
two values, our simulations with dispersions of 50kms"' and 
lOOkms"' probably define the boundaries of realistic test of 
the Eddington effect. From Figure|6lwe conclude that our esti- 
mated HiMF probably overestimates the steepness of the faint 
end slope a with Aa < 0.05. Over the range of Hi masses 
where we can reliably measure the HiMF, the H I mass density 
Phi is maximally overestimated by 8%. 

Our results are different from Rosenberg & Schneider 
(2002) who find that for their sample the Eddington effect is 
unimportant even for a dispersion of 600 kms"'. This dif- 
ference can be explained by the difference in survey depth 
between the BGC and the ADBS. In the analysis of the deep 
HIPASS catalog, the Eddington effect will be less important. 

Of course, the H I mass estimates of the nearest galaxies 
suffer the most from the distance uncertainties. A solution 
would be to impose a lower distance limit to the galaxy sam- 
ple above which the distance uncertainties are believed to be 
small. However, for samples other than integrated flux lim- 
ited, imposing a lower distance limit causes a slight drop of 
the HiMF at low H l masses. This happens because in the low 
H I mass bins only those galaxies will be selected that have 
a high peak flux compared to other galaxies of the same H I 
mass at the same distance. 

4.1.3. Inclination effects 

The inclination of galaxies might lead to two indepen- 
dent biases in the HiMF determination. One eff ect, H I self - 
absorption, is discussed in the next subsection (144.1 4.1.4> . 
Here we investigate the effect that in the galaxy selection 
procedure a minimal velocity width w,^in is applied, below 
which galaxy signals can not be reliably distinguished from 
radio frequency interference (RFI). Since galaxies with low 
inclinations have smaller velocity widths, this selection effect 
might lead to a underrepresentation of face-on galaxies. Via 
the Tully-Fisher relation (Tully & Fisher 1977) and the rela- 
tion between H I mass and optical luminosity (e.g., Roberts & 
Haynes 1994), it is known that galaxies with low Hi masses 
have lower velocity widths. The selection bias therefore be- 
comes progressively more important for dwarf galaxies, pos- 
sibly causing a flattening of the H I mass function. Lang et 
al. (2002) were the first to calculate the severeness of this ef- 
fect and concluded that for their sample of galaxies the HiMF 
could be underestimated by 18.7% at Mm = 2 x lO^M©. Here 
we make specific calculations for the BGC. 

The velocity width measured at 50% of the peak flux, wgo, 
can be written as 

W50 = (Wo Sin^ / + wl^f^ + Winst , (18) 

where wo is the intrinsic velocity spread due to rotation, Wtur is 
the velocity width due to turbulence in the gas layer, and Winst 
is the contribution of instrumental broadening to the velocity 
width. For Wtur we adopt the conservative value of 6 kms"' 
from Verheijen & Sancisi (2001) and for vvinst we use the Bot- 
tinelli et al. (1990) estimate of Winst = 0.13 x 6v, where 6v is 



the velocity resolution. For our case, where Sv = IE kms"', 
we find that winst = 2.3 kms"'. 

For detections to be included in the BGC, signal must be 
found in at least two consecutive channels. The BGC is lim- 
ited to sources with Sp > 116 mJy, but the original search 
limit was approximately 50% lower, at 60 mJy (see Koribalski 
et al. 2003). We therefore assume that the minimal velocity 
width for inclusion in the BGC, w^in, applies to the 50% level 
and hence W50 > Wmin = 26.4 kms"'. Following Lang et al. 
(2002), we can calculate from Eq.^]the minimal inclination 
angle /mm that a galaxy needs to have for it to be included in 
the sample: 

/m,n = arcsin{ ^^-"^t^'"^'- }^l (19) 
^0 

Using the values given above we find that /min = 13.5° for wq = 
100 kms"' and /min = 6.7° for wq = 200 kms"'. Recall that 
vv() is the total velocity width of a galaxy, which is twice the 
rotational velocity. 

From this we can estimate (, the fraction of galaxies poten- 
tially missed at every wq, by integrating from / = to / = /min 
over a randomly oriented sample: 

C= / Sin/c//=l— cos /min 

■ 

^ _ I ^ _ (Wmin - Winst)^ " t^mr 1 1/2 ^20) 

wl 

We find that the values for ( are very small: ^ = 2.7% (0.7%) 
for wo = 100 kms"' (200 kms"'). We note that our adopted 
values for Wmst and Wtur are very conservative, and much 
higher values can be found in the literature. With higher 
values of Winst and Wtm- the resulting ( would become even 
smaller 
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Fig. 7. — The effect of minimal velocity width on the HiMF measure- 
ment. The input HiMF is drawn as a solid line. The 2DSWML HiMF that 
is measured if face-on galaxies are missed due to the minimal velocity width 
requirement, is shown as a dotted line. The dashed line is the same, but for 
Wtur = iVinst = 0. The inset shows 0(Mhi) on a linear scale to better show the 
differences between the lines. 

Finally, in order to express <^ as a function of Hi mass, 
we need to adopt a relation between Mm and wq. Lang et 
al. (2002) adopt the relation wq = 0.42Mhi°^, where in their 
case Wq is derived from W2o- Here we use wso and find that 
Wo = 0.35Mhi" -' is a better fit to the BGC data. We note, how- 
ever, that the scatter in the correlation between Mm and wo is 
very large. An expression of as a function of Mm should 
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therefore be regarded as illustrative. If we boldly apply the 
relation, we find that C = 10% (5%) for Mm = 2 x 10^ M© 
(5 X 10^ Mq). 

In Figure we show the effect of the inclination bias. The 
soHd Hne is an HiMF with a = -1 .30 and logMj^j = 9.80, and 
the dotted line is what would have been measured if galaxies 
with low inclination angles were missed in the survey. The 
dashed line is the same, but here we test the extreme case that 
Wtui and Winst are negligible. The inset shows 0(Mhi) on a 
linear scale to better show the differences between the lines. 
The inclination bias only becomes apparent at very low H I 
masses, implying that only the last two bins of our measured 
HiMF are slightly affected by the bias. Over the H I mass 
range where the HiMF is measured, the mass density pm is 
only underestimated by approximately 1%. 

4.1.4. Hi self-absorption 

In the Milky Way Galaxy the cold neutral phase of the in- 
terstellar medium can be seen as H I self-absorption (HISA) 
against the Hi background (e.g., Knapp 1974, Gibson et al. 
2000). It is not well known to what extent this HISA reduces 
the 21cm emission line flux of external galaxies. Deep, high 
resolution 21cm maps of nearby spirals (Braun 1997) show 
that between 60 and 90% of the H I flux comes from of a net- 
work of cool gas (T < 300 K), which is expected to have a 
high opacity. However, not may instances of H I absorption 
against background continuum sources is observed in these 
regions. Dickey et al. (2000) find that in the Small Mag- 
ellanic Cloud the abundance of cool-phase gas is only 15%, 
a factor of two lower than what is found in the Milky Way 
Galaxy. The temperature of this gas is very low, typically 
40 K or less. From detailed absorption studies they calcu- 
late the HISA correction as a function of column density, and 
find that the correction is negligible for A^hi < 3 x 10^' cm"^, 
and rises to 1.4 at A^hi = 10^^ cm~^. These low correction val- 
ues agree with the analysis of Zwaan, Verheijen and Briggs 
(1999), who find that the column density distribution function 
of galaxies follows A^hi"^, expected for optically thin gas, up 
toA^Hi = 8 X 10^' cm-l 

Haynes & Giovanelli (1984) plotted the H I mass of galax- 
ies as a function of inclination and measured /hi, the ratio of 
HISA corrected to measured flux, for different morphological 
types. Zwaan et al. (1997) considered the effect of HISA on 
the HiMF, and based on Haynes & Giovanelli's (1984) result 
they found that the average value of /hi for a randomly ori- 
ented galaxy sample is approximately 1.10. They concluded 
that M^j and 9* would increase by no more than 10% if HISA 
effects were taken into account. The integral density pui could 
be underestimated by ~ 15%, maximally. Lang et al. (2002) 
addressed the issue of HISA by plotting the distribution of in- 
clination angles of their HIJASS galaxies, and found that the 
number of highly inclined galaxies is lower than what is ex- 
pected for a randomly oriented sample. From this they derive 
that /hi is 1.25 and that pui could be underestimated by as 
much as 25%. 

In FigureOthe distribution of the cosine of inclination angle 
/ of cataloged BGC galaxies is drawn as a solid line (see Jerjen 
et al. 2003 for optical properties of the BGC sample). For a 
randomly oriented sample the distribution should be flat. The 
BGC distribution is clearly under-abundant at low and high 
inclination, suggesting the effects of HIS A at high / a nd the 
minimal velocity width effect (see Section 144.1 4.1.31 at low 
/. However, also shown in Figure|8]is the cos / distribution for 
the 1000 brightest southern optically selected galaxies from 



LEDA (dashed line), and the 1000 brightest southern opti- 
cally selected galaxies with available 21cm data (dotted line). 
All distributions are indistinguishable. This implies that the 
distribution of cos / is not determined by HISA and minimal 
velocity width, but by the inclination measurements available 
from the literature. As was discussed by Huizinga & van Al- 
bada (1992), purely circular isophotes are seldom observed in 
galaxies, leading to an apparent underrepresentation of face- 
on disks. On the other end of the distribution, three different 
effect cause a deficit of galaxies. Firstly, due to dust obscura- 
tion, highly inclined galaxies drop out of the optical sample. 
These galaxies are also likely to be missed when our H I se- 
lected sample is cross-correlated with optical catalogs. Sec- 
ondly, the intrinsic thickness of galaxies causes uncertainties 
in /, especially for highly inclined galaxies. This effect causes 
the spike at cos / = and may cause a deficit at low /. Thirdly, 
HISA might cause a underrepresentation of high / galaxies. 
However, since the optical information on the BGC is incom- 
plete, it is not possible to disentangle the effects of HISA and 
optical dust extinction. We therefore conclude that based on 
the information at hand, no meaningful measurement of the 
effects of HISA can be made. In absence of reliable intrin- 
sic measurements, we adopt the externally measured value 
of 15% underestimation of fini, derived from the results of 
Zwaan etal. (1997). 




Fig. 8. — Distribution of cos inclination angle. The solid line is the dis- 
tribution for BGC galaxies with optical information. The dashed line is for 
the 1000 brightest southern optically selected galaxies from LEDA, and the 
dotted line is for the 1000 brightest southern optically selected galaxies with 
available 21cm data. 



4.1.5. Confusion 

The relatively coarse spatial resolution of the HIPASS sur- 
vey (15. '5) causes confusion of some signals with neighboring 
galaxies. Via ATCA follow-up observations and comparison 
with literature values, Koribalski et al. (2003) identified in 
the BGC 1 1 compact groups and 44 pairs of galaxies whose 
signals might chance to coincide in frequency. Furthermore, 
there are 67 sources that are flagged as 'confused'. 

It is expected that some of the confused sources would not 
make it into the BGC if they were resolved because their peak 
flux would drop below the selection limit. In that case we 
could overestimate the normalization of the HiMF as artifi- 
cial sources would enter the sample. On the other hand, if 
the unconfused sources had peak fluxes above the selection 
limit, galaxies would shift to higher mass bins, leading to an 



overestimation of M, 
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We test the two extremes by splitting all sources marked as 
'confused' or 'pair' into two sources of equal H I mass, and 
distinguish two cases: 1) the peak flux of the new sources is 
equal to that of their parent source, but their velocity width is 
halved, 2) the velocity width of the new sources is equal to 
that of their parent source, but the peak flux is halved. In this 
case only the new sources that have Sp> 116 mJy are counted. 
For case 1) we find that M^j drops by 15%, 9* increases 
by ^ 15%, and a does not change significantly. Since the 
changes in 6* and M^j are balanced, there is no nett effect on 
the H I mass density pni- In case 2) the changes in 9* and M^j 
are comparable, but additionally there is a marginal decrease 
in Of of w 0.02. As in this case some sources drop below the 
detection limit, pm drops slightly with « 2%. 

4.1.6. Cosmic variance 

In this section we investigate to what extent the shape of the 
HiMF depends on the region of the sky that is investigated. 
Previous blind H I surveys were based on relatively small re- 
gions of sky (66 degrees^ for the AHISS and 430 degrees^ 
for the ADBS), whereas the present analysis is based on a 
survey covering 2x10^ degrees^. This large area would be 
expected to guarantee a fair sampling of the local volume, but 
on the other hand, the BGC is a shallow survey compared to 
the AHISS and the ADBS. 

Figure |9l shows 2DSWML HiMFs for the four different 
quadrants of the southern sky. Around Mhi = Mj^j, the vari- 
ation in the HiMFs is only mild, but at 10^ M0 the estimated 
space density varies with a factor 5, with the first quadrant 
being the most deviant one. Not only is the estimated aver- 
age density lowest there, but the low mass end is also flatter. 
The cosmic variance is also reflected in the number of sources 
in each quadrant. The third quadrant contains 313 sources, 
whereas the fourth quadrant contains only 159 sources. 
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Fig. 9. — The effect of cosmic variance on the HiMF shape. The different 
symbols correspond to HiMF determinations in four different quadrants of 
the southern sky. 



This exercise shows that selecting small regions of sky for 
galaxy surveys can introduce substantial uncertainties in the 
estimated space density of galaxies. We stress, however, that 
several properties of a survey define its immunity against large 
scale structure: the sensitivity, the covered area of sky, and 
the shape of the covered area. The former two properties to- 
gether define the search volume of the survey. Surveys like the 
AHISS and the ADBS cover small declination ranges, but 24*' 
in RA, whereas the HIPASS SCC survey covers more square 



degrees but is concentrated only on the South celestial cap. In 
addition, the immunity against large scale structure depends 
on whether the search volume extends well beyond the local 
overdensity surrounding the Milky Way Galaxy. 

To quantify these statements and test the sensitivity to large 
scale structure of blind H I surveys, we compute for each 
survey the number of lO^Mpc-* and 20^Mpc-' cells that are 
probed for galaxies with different H I masses. The sizes of 10 
and 20 Mpc are chosen because they span the typical sizes of 
voids in the local universe (Hoyle & Vogeley 2002; Plionis & 
Basilakos 2002). For a survey to be insensitive to large scale 
structure it seems reasonable to require that it probes several 
of such cells. Note that in the calculation of the number of 
cells we do not require that all cells are fully probed, but in- 
stead we calculate the number of cells from which each survey 
selects its detections. The results are given in Table|2l which 
also shows the search volumes for different H I masses for all 
surveys. 

An important conclusion is that all surveys, except for the 
strip surveys AHISS and ADBS, select their Mhi = 10^ M© 
sources from only one 20^^Mpc^ cell, and at most a few 
lO-'Mpc-' cells. At Mhi = 10^ Mq most surveys cover a large 
number of cells, although the shallow, large scale samples se- 
lected from HIPASS still probe fewer cells than the other sur- 
veys. At Mhi = 10'" M0 all surveys are probably insensitive 
to large scale structure effects. Since these are the H I masses 
that dominate the total H I mass density (see SectionQ, mea- 
surements thereof based on HIPASS data are very secure. 

Of course, this analysis may be too complementary to the 
strip surveys. Although these surveys go through many more 
cells than the large scale surveys, their sensitivity to large 
scale structure is not reduced as much as suggested by Ta- 
ble|2l because their cells are not fully sampled. This can also 
be seen from the number of galaxies that the different surveys 
detect at each H I mass. The ADBS detected seven galaxies 
with Mhi < 10^ Mq, whereas the BGC has 38 galaxies in that 
mass range. Surely, the larger search volume and higher num- 
ber of galaxies in the BGC must reduce its susceptibility to 
the effects of large scale structure. 

4.2. Comparison with other methods 

Here we compare the results of the 2DSWML analysis of 
the BGC with those from other, more conventional estimators. 
Figure^Jshows the results of the standard l/Vmax method rep- 
resented by open circles, and the standard SWML application 
to a integrated flux limited subsample {S\at > 25 Jykms"') is 
shown by grey triangles. For comparison, the 2DSWML so- 
lution is also reproduced in this figure by a solid line. The 
lower panel of Figure ^| shows the averaged V/Vmax values 
for each bin in Hi mass. The circles and triangles represent 
the total BGC and the integrated flux-limited subsample, re- 
spectively, and values of Vmax are calculated on the bases of 
their respective selection criteria. 

The HiMFs from the 2DSWML method and the standard 
1/Vmax method are in very good agreement. This is partic- 
ularly striking since the latter method makes no corrections 
for the effects of large scale structure which are obviously 
present in our data (see Koribalski et al. 2003). The mean 
value of y/Vmax for the l/Vmax method is 0.503 ± 0.009, sup- 
porting the fact that the sample is statistically complete and 
that the effects of large scale structure average out over the 
whole sample. Zwaan et al. (1997) tested the effects of den- 
sity variations on their l/Vmax determination of the HiMF and 
also concluded that their calculation was insensitive to large 
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Table 1 . Biases in the HIMF determination 



Bias 


Effect on HiMF 


Effect on ^hi 


Selection bias 


overall increase (Ri 10%) 


< 15% over 


Eddington effect 


steepening (Act —0.05) 


8% over 


H I self-absorption 


overall decrease —10%) 


< 15% under 


Minimal velocity width 


flattening (Aa ss 0.03) 


1 % under 


Confusion 


increase in M^j (Ri 15%) and decrease in 9* (Ri 15%) 


none 


Cosmic variance 


small error in shape 


uncertainty ~ 10% 



Table 2. Number of cells and volumes probed by blind H i surveys 



Sample 


Mm 


~ 10 


*M0 


Mm 




10' M© 


Mm 


~ 10 


'"Mo 




10^ 


20' 


volume 


103 


20^ 


volume 


I03 


203 


volume 


HIPASS BGC 


2 


1 


5 


30 


4 


90 


1500 


190 


1600 


AHISS 


60 


15 


0.35" 


200 


50 


1.6" 


200 


50 


1.6" 


Arecibo Slice 


3 


1 


0.25 


30 


8 


4.2 


50 


12 


6.5 


HIPASS sec 


2 


1 


0.6 


10 


2 


10 


200 


20 


180 


HIPASS HIZSS 


2 


1 


0.25 


14 


4 


4.7 


100 


24 


84 


ADBS 


20 


5 


0.5 


120 


20 


8.6 


700 


100 


51 



Note. — For each H I mass, the first column gives the number of probed 10^ Mpc^ cells and the second column gives the number of probed 20^ Mpc"^ cells. The third column gives the 
approximate search volumes in 1000 Mpc^. All search volumes are calculated using the quoted rms noise levels in the respective papers and assume optimal smoothing and a relation 
between velocity width and H I mass as given in Zwaan et al. (1997). 
^ Not counting sidelobes 



scale structure. For Hi masses < 10 M©, the l/Vmax method 
finds higher space densities than the 2DSWML method. This 
is most likely the result of the fact that these galaxies are all 
drawn from the very local universe (< 10 Mpc, see FigurelTTl. 
which we know to be overdense. Also the Poisson errors on 
these points are very high since they are based on small num- 
bers of galaxies. 




7 8 9 10 11 

log M^^, [Mg] 



Fig. 10. — Comparison of different HiMF estimators. The top panel shows 
the HiMF of BGC galaxies derived via the VVm^^ method as open circles 
and the standard SWML HiMF of an integrated flux limited subsample as 
grey triangles. The 2DSWML HiMF from Figurel^is reproduced as a solid 
line. The lower panel shows the median values of V/Vmax in each bin, with 
Icr uncertainties. Open circles again represent the 1/Vma,ii method and grey 
triangles the standard SWML method. 

The SWML HiMF is also in good agreement with both 



other methods, but the uncertainties on the space densities for 
Mhi < 10** Mq are large because the integrated flux limited 
subsample contains only a small number of low mass galax- 
ies. The mean V/Vmax value is 0.500 ±0.013, which suggests 
that the integrated flux limited subsample is complete. 




D [Mpc] 



Fig . 11. — Hi masses of BGC galaxies as a function of their distance. The 
solid line indicates an integrated flux limit of 25 Jy Ions"' above which the 
sample is 'complete'. 



5. COMPARISON TO OTHER SURVEYS 

In the previous section we have shown that the results on 
the HiMF are internally consistent. We now compare the re- 
sults with those of previous Hi surveys. In Figure[2]we re- 
produce the best fit Schechter function to the BGC HiMF as 
a thick solid line. In addition we plot the HiMFs from the 
Arecibo Hi Strip Survey (AHISS, Zwaan et al. 1997, dot- 
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ted line), the Arecibo Slices (AS, Schneider et al. 1998), the 
Arecibo Dual Beam Survey (ADBS, Rosenberg & Schneider 
2002, dashed line) and the HIPASS South Celestial Cap (SCC, 
Kilborn 2000, long-dashed line). The ranges over which the 
Schechter function have been plotted indicate the H I mass 
bracket within which the HiMFs have been reliable deter- 
mined from the different surveys. All curves have been con- 
verted to H() = 75kms~'Mpc~'. The Schechter parameters of 
the various HiMFs are given in Table|3] 

All curves agree very well at the high mass end, which 
can also be seen from the value of logM^j, which is « 9.8 
for all surveys. Discrepancies arise at the low mass end, 
where Zwaan et al. (1997) found a = -1.2 and Rosenberg 
& Schneider found a = -1.53. Undoubtedly, part of these 
differences can be explained by small number statistics, but 
different treatments of the survey completeness and different 
HiMF estimators may also cause discrepancies. 
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Fig. 12.— Comparison of the HIPASS BGC HiMF with previous calcu- 
lations from blind H I surveys. Shown are HiMFs from the AHISS (Zwaan 
et al. 1997), the AS (Schneider et al. 1998), the HIPASS SCC (Kilborn 
2000), the HIPASS HIZSS (Henning et al. 2000), and the ADBS (Rosen- 
berg & Schneider 2002). All mass functions have been converted to the same 
value of H(i = 75 kms"' Mpc"' and are only plotted over the H I mass range 
in which they have been measured. 

The AHISS (Zwaan et al. 1997) and the ADBS (Rosen- 
berg & Schneider 2002) are both drift-scan surveys performed 
with the Arecibo telescope. The AHISS is more sensitive 
with an 1(7 rms noise of ^ 0.75 mJy/beam compared to ^ 3.5 
mJy/beam per 32 kms"' for the ADBS, but the sky coverage 
of the ADBS was much larger which is reflected in the number 
of detections: 66 in the AHISS and 265 in the ADBS. Both 
galaxy samples are based on by-eye examinations of the H I 
spectra, which means that the completeness limits have to be 
calculated a posteriori. Assessing the completeness limits of 
the drift-scan surveys is complicated because the sensitivity 
is a function of declination offset, which is particularly im- 
portant if side-lobe detections have to be taken into account 
(AHISS). Zwaan et al. (1997) derived an analytical expres- 
sion for the 'detectability' of sources, Rosenberg & Schneider 
(2002) used a large number of synthetic sources to test their 
completeness. One of the differences in the analysis of both 
surveys is the treatment of the variation of sensitivity S with 
velocity width, which can be expressed as 5 oc AV'^. Assum- 
ing optimal smoothing and uncorrected noise, the expected 
value of f3 is 0.5, which was adopted by Zwaan et al. (1997). 
Rosenberg & Schneider (2002) argue that /3 = 0.75 which 



leads to a reduced sensitivity to large line width sources, and 
hence, via the Tully-Fisher relation, to a reduced sensitivity to 
high H I mass sources. 

With a peak flux limit of 1 16 mJy, the BGC is a relatively 
shallow survey compared to the ADBS and the AHISS. How- 
ever, thanks to the BGC's large sky coverage, the number 
counts at the low mass end are better than those from AHISS 
and ADBS, and the BGC HiMF extends to lower H 1 masses 
than either of the Arecibo surveys. This low sensitivity causes 
all low mass galaxy detections to be at small distances (see 
Figure II 1> . which implies that the Eddington effect might 
have a significant influence on the HlMF shape. 

Davies et al. (2001) recently advocated an HlMF slope of 
a « -2, which they derived by extracting H I spectra from the 
HIPASS public release data base, and compare the distance 
distribution of detections with what would be expected for 
different HiMFs. This method intrinsically assumes that a re- 
lationship exists between distance and the minimal detectable 
H 1 mass at that distance. We have argued in Section l3 3.1l that 
this assumption does not hold for H I selected galaxy samples. 
The validity of the Davies et al. (2001) result is therefore un- 
clear 

6. DEPENDENCE OF THE HlMF ON GALAXY TYPE 

The BGC was cross-correlated with the LEDA data base to 
find matches with cataloged galaxies (see Jerjen et al. 2003). 
In addition to this, Ryan- Weber et al. (2002) searched for 
uncataloged BGC galaxies on the Digitized Sky Survey, and 
determined morphological types for these new galaxies. In to- 
tal, morphological type information is now available for 892 
galaxies out of our total sample of 1000. The 108 unclassified 
galaxies consist mainly of galaxies at low Galactic latitude, 
groups or pairs for which a unique match between H 1 signal 
and optical galaxy could not be made, or detections for which 
a cross correlation with LEDA galaxies is otherwise ambigu- 
ous. 

In order to determine the morphological type dependence 
on the HlMF we divide the BGC into five subsets of galaxies: 
E-SO, Sa-Sb, Sbc-Sc, Scd-Sd, and Sm-lrr. Not surprisingly 
for an HI selected sample, the subset of early-type galax- 
ies contains only 43 galaxies, which is marginally sufficient 
to calculate a meaningful HlMF. In addition, we calculate 
the HlMFs for galaxy samples divided into 'late' and 'early', 
where galaxy types later than Sb are regarded as 'late'. 

Figure [T3I shows the Hi mass functions for different mor- 
phological types. We have chosen to apply the l/Vmax method 
here, because the subsamples are uncomfortably small for a 2- 
dimensional analysis. For reference, the best fit HlMF for the 
total BGC is indicated by a dotted line in each panel. Best fit 
Schechter functions to the individual HiMFs are also shown, 
but note that for the E-SO subsample the normalization and 
are very poorly defined. All HlMFs are multiplied by 
1000^892 to correct for the incompleteness in morphological 
classification. Tablel^includes the best-fit Schechter parame- 
ters for all morphological types. 

There is no clear variation in the HlMF slope between types 
Sa and Sd, the HlMF is flat with a -1.0. Only for galax- 
ies in the Sm-lrr bin, we see a steepening of the HlMF to 
a = -1.4. This effect is very similar to what is found for op- 
tical luminosity functions (Marzke et al. 1998). Zwaan et 
al. (2001) used the Marzke et al. (1998) luminosity functions 
and fitted relations between H I mass and optical luminosity 
to derive type specific Hi mass functions. These are in good 
agreement with what is found here, and also show a steep- 
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Table 3. Schechter parameters of H i mass functions 



Sample 


a 




e* 

[lO^/i^jMpc-^] 


Reference 


HIPASS BGC 


-1.30 


9.79 


86 


this paper 


AHISS 


-1.20 


9.80 


59 


Zwaan et al. (1997) 


HIPASS sec 


-1.52 


10.1 


32 


Kilborn (2000) 


HIPASS HIZSS 


-1.51 


9.70 


60 


Hemiing et al. (2000) 


ADBS 


-1.53 


9.88 


50 


Rosenberg & Schneider (2002) 



Table 4. Type-specific H i mass functions 



Type 


a 




e* 






[/lyiMo] 




E-SO 


-1.78 


10.2" 


0.92" 


Sa-Sb 


-0.75 


9.65 


22 


Sbc-Sc 


-1.00 


9.77 


49 


Scd-Sd 


-1.01 


9.42 


46 


Sm-In' 


-1.41 


9.32 


61 


late 


-1.33 


9.75 


78 


early 


-1.19 


9.76 


21 



"Values uncertain because a Schechter function is a poor fit to the data points 



ening for the very late-type galaxies. Apparently, both the 
number density of low luminosity and low H I mass galaxies 
are dominated by late-type galaxies. 

Another conclusion from Figure^]is that the characteristic 
mass MJ^[ is much higher for types Sbc-Sc than for the other 
types. This reflects that most galaxies with high H 1 masses 
are of types Sbc-Sc, and that these types are the most domi- 
nant contributors to the total H I mass density. 



J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l_ 




- E-SO - 
"I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 PI" 
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- So-Sb - 
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Fig. 13. — HI mass function for different morphological types. The solid 
lines show Schechter fits to the type-specific HiMFs and the dashed line is 
HiMF for the total sample. In the lower right panel, late-type galaxies are all 
types later than Sb. 



Similarly, the HiMF can be divided into galaxies of differ- 
ent optical surface brightness. The question of how much low 
surface brightness (LSB) galaxies contribute to the local H I 
density and the total local baryon budget has been addressed 
before by several authors. Based on local galaxy samples 
with targeted 21cm spectroscopic follow-up, Briggs (1997a, 
1997b) concluded that LSB galaxies make a ^ 10% addition 
to the total Hi mass density. Zwaan, Briggs & Sprayberry 
(2001) used the AHISS to find that galaxies with central sur- 
face brightness > 23.0 mag arcsec^ contribute 18% to the Hi 
density. The luminosity density contained in LSB galaxies is 
also found to be low (Sprayberry et al. 1997, Driver 1999, de 
Jong & Lacey 2000, Zwaan et al. 2001). 

Measurements of optical surface brightness for the BGC 
are drawn from LEDA. We choose to use the mean effec- 
tive surface brightness fi^ff, which is defined as the mean sur- 
face brightness inside the aperture enclosing one-half the to- 
tal light. Unfortunately, the measurements are not complete: 
only for 600 galaxies in the BGC sample is /ieff available. The 
uncertainties in the HiMFs for LSB and high surface bright- 
ness (HSB) galaxies are therefore large. We divide the sample 
into two subsamples, where we use /ieff= 24.0 mag arcsec"^ as 
the boundary between a LSB and a HSB galaxy. This sepa- 
ration results in 96 LSB and 504 HSB galaxies. For a expo- 
nential disk, the value of /ieff = 24.0 mag arcsec"^ compares to 
a central surface brightness of /ieff = 22.2 mag arcsec"-. The 
values of /ieff are uncorrected for inclination and dust extinc- 
tion. 




log M„, [Mq] 

Fig. 14. — HI mass function for low surface brightness and high surface 
brightness galaxies. The HiMF for the total sample is drawn as a dashed 
curve. 

FigureOshows HiMFs for the LSB and HSB subsamples. 
Both are corrected by a factor 1000^600 to account for the 
incompleteness in surface brightness measurements. Simi- 
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larly to what was found by Briggs (1997a), we see that the 
HiMF for LSB galaxies is steeper (a = -1.35) than that for 
HSB galaxies (a = -1.2). The obvious reason for this is that 
there is a relation between surface brightness and H I mass, 
in the sense that LSB galaxies have lower total H I masses. 
LSB galaxies therefore increasingly populate the lower Hi 
mass bins, resulting in a steeper H I mass function compared 
to HSB galaxies. Nonetheless, LSB galaxies contribute only 
very little to the total H I mass density. Based on the Schechter 
fits we find that this contribution is ^ 15%. 

7. THE Hi MASS DENSITY 

Figure [21 shows the H I mass density pui contained in 
galaxies of different H I mass. The heavy solid line shows 
the converted best-fit Schechter function, and the dotted and 
dashed lines show the H I mass densities derived by Zwaan et 
al. (1997) and Rosenberg & Schneider (2002), respectively. 
The bivariate Hi mass density distribution in the (MeijWio) 
plane is shown in Figure This figure clearly shows that 
the gas mass density is dominated by galaxies with H I masses 
around M^j and velocity widths of ^ 250 kms"'. 




Fig. 15. — The distribution of H I mass density over different H I masses. 
The points show the measured mass density derived by multiplying the 
2DSWML HiMF by Mhi. The solid line is the converted best fit Schechter 
function. The dotted line is the mass density distribution function from 
Zwaan et al. (1997) and the dashed Hne is that derived by Rosenberg & 
Schneider (2002). 

The total H l mass density contained by galaxies in the lo- 
cal universe is calculated with pei = 0*r(2 + a)M^^, and is 
found to be Phi = (6.9 ± 1.1) X 10^/175 MqMpc"^ A straight 
summation of Mhi0(Mhi) using the points in Figure0]gives a 
sUghtly lower value of pei = (6.6 ± 1.1) x 10^/175 MqMpc"^ 
because this calculation does not include the extensions of the 
Schechter fit beyond the region where we can measure it re- 
liably. Taking into account all biases as summarized in Ta- 
bleffl we find pui = (6.1 ± 1.0) x 10'/z75MoMpc-^ or (4.1 ± 
0.7) X lO'^^'gcm"-', where the error is the Icr uncertainty de- 
rived with the jackknife method, which includes random er- 
rors and the effects of large scale structure. For comparison, 
Zwaan et al. (1997) found pm = 4.3 x lO^/z75M0Mpc-^ and 
using Rosenberg & Schneider's (2002) Schechter parameters 
we find Phi = 7 . 1 X 1 0^ Mq Mpc"^^ . Briggs ( 1 990) used lu- 
minosity functions and a conversion factor from luminosity to 
Hi mass to find pei = 7.0 x 10^/i75MqMpc"-^, a value very 
close to our measurement. 

Converting pui to the more convenient f^ei, the H I mass 



density as a fraction of the critical density of the universe, 
we find ilni = (3.8 ±0.6) x lO'^/z^g and after making a cor- 
rection for the 24% helium mass fraction we find ilatomic = 
(4.8 ±0.8) X 10~'*/z7g. This measurement of ilui is very ro- 
bust because the galaxies that contribute most to the gas den- 
sity are also the ones that dominate the counting statistics 
(Mhi = 10''- 10'° M0). Uncertainties in the faint-end slope 
a therefore contribute little to the total error in Ohi . Further- 
more, since these galaxies are found at large distances, and 
therefore over a large region of the sky, the effects of large 
scale structure are unimportant. 
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Fig. 16. — Bivariate H I mass density distribution in the (Mhi, W20) plane. 
The greyscales ai'e on a logarithmic scale and represent the mass density in 
MqMpc"' per decade of Mm and decade of n'20. The Hi mass density is 
dominated by high mass, high velocity width galaxies. 

Fukugita, Hogan & Peebles (1998) estimated that the mass 
fraction in molecular gas is approximately 80% of that of 
atomic gas. Keres, Yun & Young (2002) measure the CO lu- 
minosity function and derive that the mass fraction in molecu- 
lar gas is approximately 60%. Averaging these two values, we 
find that the total mass density in cool gas in the local universe 
is Ocooigas = 7.6 X 10~*hjl. To put these numbers into per- 
spective, the most recent determination of the baryon density 
via the primeval deuterium abundance is ilbaryon/!75 = 0.035 ± 
0.004 (Buries, Nollett & Turner 2001). Microwave back- 
ground anisotropy measurements give slightly higher values 
(e.g., de Bernardis et al. 2000). We estimate from this that 
cool gas in galaxies makes up approximately 2% of the total 
baryon density in the local universe. The total mass in diffuse 
ionized intergalactic gas is much higher than this. From low 
redshift HST spectra Penton, Shull, & Stocke (2000) derived 
that the mass density in the Lya forest is approximately 20% 

of r^baiyon ■ 

Our derived value of fini is approximately 5 times lower 
than that at redshifts 2-4 (Ellison et al. 2001, Peroux et al. 
2001, Storrie-Lombardi, McMahon, & Irwin 1996). A grad- 
ual conversion from neutral gas to stars in the disks of galaxies 
is generally believed to cause this decline in r^Hi (Lanzetta et 
al. 1995, Pei, Fall & Hauser 1999), but recent results on high 
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column density QSO absorption-line systems at intermediate 
redshifts have confused this picture. Lane (2000) searched for 
intermediate z 21cm absorption in Mgll selected systems, and 
used Mgll statistics to bootstrap the flui values. The same 
statistics was used by Rao & Turnshek (2000) for a Ly-a sur- 
vey of z < 1.65 Mgll selected systems. Both authors find 
values of flmiz < 1.65) consistent with those at higher red- 
shifts, indicating that the neutral gas density does not evolve 
strongly from high z to the present. Churchill (2001) recently 
performed an unbiased survey for low-z Mgll systems and 
used the same statistics for Mgll systems as Rao & Turnshek 
did, and derived flmiz = 0.05), which is a factor 5 larger than 
our results. The uncertainties on all these measurements are 
very large as they suffer from small number statistics. The 
evolution of ilni from z = 2 to the present time therefore re- 
mains very uncertain and more intermediate redshift optical 
and 21cm surveys are required to constrain the evolution of 
Hi. 



Vlg [km/s] 



2000 



4000 



6000 




40 60 
D [Mpc] 

Fig. 17. — Distance distribution of tlie BGC. Tlie thick histogram shows 
the measured redshift distiibution of galaxies in the BGC, the smooth dashed 
curve is the predicted distribution based on the selection function S(D), which 
is calculated with the 2DSWML method. Both distributions are normalized 
to AD = 1 Mpc. The thin lines show the same for a subsample with 5jnt > 
25 Jy km s"' . 



8. THE BGC SELECTION FUNCTION 



As described in Section l3331 the 2DSWML method allows 
us the calculate the selection function of the BGC. The selec- 
tion function is an important input into calculations of cluster- 



ing in the BGC, like the 2-point correlation function (Meyer 
et al. 2003b). It is therefore interesting to test whether the 
calculated selection function S(D) is a valid approximation of 
the true selection of the survey. 

In Figure[n]we compare the measured redshift distribution 
of BGC galaxies with the distribution implied by the derived 
selection function. The overall redshift distribution agrees 
well with the calculated smooth curve. This is also reflected 
in the predicted number counts of galaxies from the selection 
function, which is 0.0456 galaxies/degree^. This is only 6% 
lower than the measured number count in the BGC of 0.0485 
galaxies/degree^. This implies that if we had used the number 
counts to normalize the HiMF similar to what is occasionally 
done for optical LFs (e.g., Norberg et al. 2001), we would 
have found essentially the same normalization. 

By comparing N{D) with the smooth curve, two overden- 
sities become apparent: one at 22 Mpc and one at ^ 36 
Mpc. See Koribalski et al. (2003) for a detailed discussion of 
the large-scale structure in the BGC. Also shown in FigureFTTl 
is the redshift distribution of the integrated flux limited sub- 
sample, and the predicted distribution calculated with Eg. 1141 
The same density structure can be seen here, although less 
pronounced than for the full galaxy sample. 

9. CONCLUSIONS 

We have used the HffASS Bright Galaxy Catalog (BGC, 
Koribalski et al. 2003) consisting of the 1000 southern galax- 
ies with highest peak flux densities, to measure the H 1 mass 
function (HiMF) of galaxies in the local universe. This 
is the largest sample of galaxies ever used to measure the 
HiMF, and contains galaxies over the H 1 mass range from 
log(MH/M0) + 21og/i75 = 6.8 to 10.6. We develop a bi- 
variate stepwise maximum likelihood method to measure the 
HiMF. This method solves for H 1 mass and velocity width, 
and then integrates over velocity width to find the HiMF. 
We show that this method is a reliable estimator and insen- 
sitive to the effects of large scale structure. The resulting 
HiMF can be well fit with a Schechter function with param- 
eters a = -1.30 ±0.08, log(M*/MQ) = 9.79 ± 0.06/175^ and 
e* = (8.6 ±2.1) X 10-^/1^5 Mpc"l We find that the faint-end 
slope of the HiMF is dependent on morphological type. Late- 
type galaxies show the steepest faint-end slopes and these 
galaxies dominate the statistics at low H 1 masses. We exten- 
sively test the influence of possible biases in the HlMF deter- 
mination, including peculiar motions of galaxies, inclination 
effects, selection biases and large scale structure, and quan- 
tify these biases. The integral H 1 mass density in the local 
universe is found to be pm = (6.1 ± 1.0) x 10^/175 MoMpc"^^, 
contributing a fraction flm = (3.8 ± 0.6) x 10~^/i75 of the crit- 
ical density of the universe. 



REFERENCES 



Babul, A. & Ferguson, H. C. 1996, ApJ, 458, 100 
Babul, A. & Rees, M. J. 1992, MNRAS, 255, 346 
Barkana, R. & Loeb, A. 1999, ApJ, 523, 54 
Barnes, D. G. et al. 2001, MNRAS, 322, 486 
Blanton, M. R. et al. 2001, AJ, 121, 2358 

Bottinelli, L., Gouguenheim, L., Fouque, R, & Paturel, G. 1990, A&AS, 82, 
391 

Braun, R. 1997, ApJ, 484, 637 

Biiggs, F H. 1990, AJ, 100, 999 

Briggs, F H., & Rao, S. 1993, ApJ, 417, 494 

Briggs, F H. 1997a, ApJ, 484, 618 



Biiggs, F H. 1997b, ApJ, 484, L29 

Buries, S., Nollett, K. M., & Turner, M. S. 2001, ApJ, 552, LI 
Churchill, C. W. 2001, ApJ, 560, 92 

Corbelli, E., Salpeter, E. E., Bandiera, R., 2001, ApJ, 550, 26 
Davis, M. & Huchra, J. 1982, ApJ, 254, 437 

Davies, J. I., de Blok, W. J. G., Smith, R. M., Kambas, A Sabatini, S., Linder, 

S. M., & Salehi-Reyhani, S. A. 2001, MNRAS, 328, 1151 
de Bemardis, R et al. 2000, Nature, 404, 955 
de Jong, R. S. & Lacey, C. 2000, ApJ, 545, 781 
Dekel, A. & Silk, J. 1986, ApJ, 303, 39 

Dickey, J. M., Mebold, U., Stanimirovic, S., & Staveley-Smith, L. 2000, ApJ, 
536, 756 



16 



Zwaan et al. 



Driver, S. P. 1999, ApJ, 526, L69 

Efstathiou, G., Ellis, R. S., & Peterson, B. A. 1988, MNRAS, 231, 479 
ElUson, S. L., Yan, L., Hook, I. M., Pettini, M., Wall, J. V., & Shaver, P 2001, 

A&A, 379, 393 
Folkes, S. et al. 1999, MNRAS, 308, 459 

Fukugita, M., Hogan, C. J., & Peebles, P. J. E. 1998, ApJ, 503, 518 
Gibson, S. J., Taylor, A. R., Higgs, L. A., & Dewdney, P E. 2000, ApJ, 540, 
851 

Haynes, M. R & Giovanelli, R. 1984, AJ, 89, 758 

Huizinga, J. E. & van Albada, T. S. 1992, MNRAS, 254, 677 

Henning, R A. et al. 2000, AJ, 119, 2686 

Hoyle, F. & Vogeley, M. S. 2002, ApJ, 566, 641 

Jerjen, H. et al. 2003, in preparation 

Jing, Y. P, Bomer, G., & Suto, Y. 2002, ApJ, 564, 15 

Keres, D., Yun, M., S., & Young, J. S. 2002, ApJ, in press 

Kilbom, V. A. 2000, Ph.D. Thesis, University of Melbourne 

Kilbom, V. A. et al. 2002, AJ, 124, 690 

Knapp, G. R. 1974, AJ, 79, 527 

Koribalski, B. S. et al. 2003, submitted to AJ 

Kraan-Korteweg, R. C, van Driel, W., Briggs, P., Binggeli, B., & Mostefaoui, 

T. I. 1999, A&AS, 135, 255 
Lane, W. M. 2000, Ph.D. Thesis, University of Groningen 
Lang, R. H. et al. 2002, MNRAS, in press 

Lanzetta, K. M., Wolfe, A. M., & Turnshek, D. A. 1995, ApJ, 440, 435 
Loveday, J. 2000, MNRAS, 312, 557 

Lupton, R. 1993, Statistics in Theory and Practise (Princeton: Princeton Univ. 
Press) 

Marzke, R. O., da Costa, L. N., Pellegrini, P S., Willmer, C. N. A. & Geller, 

M. J. 1998, ApJ, 503, 617 
Meyer, M. J., Zwaan, M. A., Webster, R. L., et al. 2003a, in prep. 
Meyer, M. J. et al. 2003b, in prep. 
Norberg, P et al. 2001, astro/ph-01 11011 
Pei, Y. C, Fall, S. M., & Hauser, M. G. 1999, ApJ, 522, 604 



Penton, S. V., Shull, J. M., & Stocke, J. T. 2000, ApJ, 544, 150 

Peroux, C. ., Irwin, M. J., McMahon, R. G., & Storrie-Lombardi, L. J. 2001, 

Astrophysics and Space Science Supplement, 277, 551 
Plionis, M. & Basilakos, S. 2002, MNRAS, 330, 399 
Rao, S. M. & Turnshek, D. A. 2000, ApJS, 130, 1 
Roberts, M. S. & Haynes, M. P 1994, ARA&A, 32, 115 
Rosenberg, J. L. & Schneider, S. E. 2002, ApJ, 567, 247 
Rosenberg, J. L. & Schneider, S. E. 2000, ApJS, 130, 177 
Ryan-Weber, E. et al. 2002, AJ, 124, 1954 
Ryder, S. D. et al. 2001, ApJ, 555, 232 

Sandage, A., Tammann, G. A., & Yahil, A. 1979, ApJ, 232, 352 
Schechter, P 1976, ApJ, 203, 297 

Schlegel, D., Davis, M., Summers, R, & Holtzman, J. A. 1994, ApJ, 427, 527 
Schmidt, M. 1968, ApJ, 151, 393 

Schneider, S. E., Spitzak, J. G., & Rosenberg, J. L. 1998, ApJ, 507, L9 
Sheth, R. K. & Diaferio, A. 2001, MNRAS, 322, 901 

Staveley-Smith, L. et al. 1996, Publications of the Astronomical Society of 
Australia, 13, 243 

Sprayberry, D., Impey, C. D., Irwin, M. J., & Bothun, G. D. 1997, ApJ, 482, 
104 

Storrie-Lombardi, L. J., McMahon, R. G., & Irwin, M. J. 1996, MNRAS, 
283, L79 

Storrie-Lombardi, L. J. & Wolfe, A. M. 2000, ApJ, 543, 552 

Strauss, M. A., Ostriker, J. P, & Cen, R. 1998, ApJ, 494, 20 

Tully, R. B., & Fisher, J. R., 1977, A&A, 54, 661 

Verheijen, M. A. W. & Sancisi, R. 2001, A&A, 370, 765 

Willick, J. A., Strauss, M. A., Dekel, A., & Kolatt, T. 1997, ApJ, 486, 629 

Willmer, C. N. A. 1997, AJ, 1 14, 898 

Zwaan, M. A., Briggs, R H., & Sprayberry, D. 2001, MNRAS, 327, 1249 
Zwaan, M. A., Briggs, R. H., Sprayberry, D., & Sorar, E. 1997, ApJ, 490, 173 
Zwaan, M. A., Verheijen, M. A. W., & Briggs, R. H. 1999, Publications of the 
Astronomical Society of Australia, 16, 100 



